Extending XPath to Support Linguistic Queries

نویسندگان

  • Steven Bird
  • Yi Chen
  • Susan B. Davidson
  • Haejoong Lee
  • Yifeng Zheng
چکیده

Linguistic research and language technology development employ large repositories of ordered trees. XML, a standard ordered tree model, and XPath, its associated language, are natural choices for linguistic data storage and queries. However, several important expressive features required for linguistic queries are missing in XPath. In this paper, we motivate and illustrate these features with a variety of linguistic queries. Then we define extensions to XPath which support linguistic tree queries, and describe an efficient query engine based on a novel labeling scheme. Experiments demonstrate that our language is not only sufficiently expressive for linguistic trees but also efficient for practical usage. Comments Proceedings of the Workshop on Programming Language Technologies for XML 2005 (PLAN-X 2005). This conference paper is available at ScholarlyCommons: http://repository.upenn.edu/cis_papers/128 Extending XPath to Support Linguistic Queries Steven Bird , Yi Chen , Susan B. Davidson , Haejoong Lee , and Yifeng Zheng University of Pennsylvania, University of Melbourne sb,yicn,susan,haejoong,yifeng @cis.upenn.edu ABSTRACT Linguistic research and language technology development employ large repositories of ordered trees. XML, a standard ordered tree model, and XPath, its associated language, are natural choices for linguistic data storage and queries. However, several important expressive features required for linguistic queries are missing in XPath. In this paper, we motivate and illustrate these features with a variety of linguistic queries. Then we define extensions to XPath which support linguistic tree queries, and describe an efficient query engine based on a novel labeling scheme. Experiments demonstrate that our language is not only sufficiently expressive for linguistic trees but also efficient for practical usage.Linguistic research and language technology development employ large repositories of ordered trees. XML, a standard ordered tree model, and XPath, its associated language, are natural choices for linguistic data storage and queries. However, several important expressive features required for linguistic queries are missing in XPath. In this paper, we motivate and illustrate these features with a variety of linguistic queries. Then we define extensions to XPath which support linguistic tree queries, and describe an efficient query engine based on a novel labeling scheme. Experiments demonstrate that our language is not only sufficiently expressive for linguistic trees but also efficient for practical usage.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Filtering System to accelerate XML Access Control Enforcement

work is a proposal for an efficient yet secure XML access control enforcement which has been specifically designed to support fine-grained security policy. Without a structural summary, to ensure that user queries only use and return XML data the user is allowed to access can be quite inefficient in checking user queries and in traversing for XPath queries with the self-or-descendant axis(" // ...

متن کامل

LPath+: A First-Order Complete Language for Linguistic Tree Query

Large databases of linguistic annotations are used for testing linguistic hypotheses, and for training language processing models. Linguistic annotations are often syntactic or prosodic and typically have a tree structure. Our goal is to develop a language that can express a wide range of linguistic tree queries and has an efficient implementation. We argue that by adding some simple closures t...

متن کامل

Embedding Xpath Queries into SPARQL Queries

While XPath is an established query language developed by the W3C for XML, SPARQL is a new query language developed by the W3C for RDF data. Comparisons between the data models of XML and RDF and between the query languages XPath and SPARQL are missing. Since XML and XPath are earlier recommendations of the W3C than RDF and SPARQL, currently more XML data and XPath queries are used in applicati...

متن کامل

A New Structure for Accelerating XPath Location Steps

—Indexing technology is one of the kernel technologies in database management systems, especially for large datasets. XML (eXtensible Markup Language) has been successfully adopted as a universal data exchange format, particularly in the WWW environment. It is an important and basic task to efficiently implement XPath axes on XML documents. Using R*-tree, T. Grust proposed an interesting method...

متن کامل

Transforming XPath Queries for Bottom-Up Query Processing

The widespreading of XML as a content-description language on the Web requires advanced processing and management techniques for huge XML databases. XPath is a standard language for extracting the specified elements from XML documents, and its efficient support is one of the key issues in the current XML database technology. In this paper, we propose an XPath query transformation method for the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004